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ABSTRACT 

Clusters of galaxies usually contain rich populations of globular clusters (GCs). We investigate how different star formation histories 
(SFHs) shape the final mass distribution of star clusters. 

We assumed that every star cluster population forms during a formation epoch of length St at a constant star-formation rate (SFR). 
The mass distribution of such a population is described by the embedded cluster mass function (ECMF), which is a pure power law 
extending to an upper limit Mn,ax- Since the SFR determines M^ax, the ECMF implicitly depends on the SFR. 

Starting with different SFHs, the time-evolution of the SFR, each SFH is divided into formation epochs of length St at different 
SFRs. The requested mass function arises from the superposition of the star clusters of all formation epochs. An improved optimal 
sampling technique is introduced that allows generating number and mass distributions, both of which accurately agree with the 
ECMF. Moreover, for each SFH the distribution function of all involved SFRs, F(SFR), is computed. For monotonically decreasing 
SFHs, we found that F(SFR) always follows a power law. 

With /"(SFR), we developed the theory of the integrated galactic embedded cluster mass function (IGECMF). The latter describes 
the distribution function of birth stellar masses of star clusters that accumulated over a formation episode much longer than St. The 
IGECMF indeed reproduces the mass distribution of star clusters created according to the superposition principle. Interestingly, all 
considered SFHs lead to a turn-down with increasing star cluster mass in their respective IGECMFs in a similar way as is observed 
for GC systems in different galaxy clusters, which offers the possibility of determining the conditions under which a GC system was 
assembled. 

Although assuming a pure power-law ECMF, a Schechter-like IGECMF emerges from the superposition principle. In the past decade, 
a turn-down at the high-mass end has been observed in the cluster initial mass function. This turn-down can be explained naturally if 
the observed star cluster ensembles are superpositions of several individual star cluster populations that formed at different times at 
different SFRs. 
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1. Introduction 

Galaxy clusters form through the coalescence of galaxy groups 
and the infall of individual galaxies. In the course of time, the 
number of galaxies in a galaxy cluster steadily grows, thereby 
increasing the density of galaxies and enlarging the probability 
for galaxy-galaxy encounters during which star formation takes 
place. Thus, galaxy clusters and their major galaxies were as¬ 
sembled from many such encounters in which new stars and star 
clusters (SCs) were formed, as well as through the accretion of 
gas, stars, and SCs. 

During such collisions, star formation is expected to occur at 
a higher rate than observed in today’s Universe since the early 
galaxies were notably gas-rich. In these events, SCs of a wide 
mass range were formed, and due to the high star-formation 
rates (SFRs), a substantial number of high-mass SCs were able 
to form (e.g., Larsen & Richtler 2000| l. Even if massive stars 
have short lifetimes and low-mass SCs dissolve fast in a tidal 
field, at least the high-mass SCs have a considerable chance to 
survive over a Hubble time, enabling us to observe them today 
as globular clusters (GCs). 

Our idea is to use the surrounding SCs - in this case, the 
ancient GCs that probably formed during the above-mentioned 
interactions - to derive the star formation activities at that time 


and to determine under which conditions the major galaxies, and 
in the end, the host galaxy cluster itself, were assembled. The 
overall mass distribution of these GCs may show features that 
bear a memory of such events because in galaxy-galaxy inter¬ 
actions SC populations are formed during bursty phases, while 
infalling galaxies contribute SC populations that largely formed 
under quiescent conditions. Other collaborations also use SCs to 


constrain properties of the host galaxy (e.g., Cote et al. 1998 


Maschberger & Kroupa| 

2007[ Beasley et al. 2008 

Norris & 

Kannappan 201 1[ Georgiev et al. 1 201211. To this enc 

, it is in- 


vestigated how the star formation activities influence the mass 
distribution of SCs to provide the theoretical groundwork for an¬ 
alyzing SC systems in the future. 

What shapes the mass distribution of SCs? Since the major 
galaxies in a galaxy cluster have probably undergone several in¬ 
teraction processes with intense star formation episodes in the 
past, an SC sample observed around such a galaxy is most likely 
a superposition of different SC populations formed at different 
times. From SC formation it is known that the mass distribution 
function of a newly born SC population that formed coevally 
can be described by the so-called embedded cluster mass func- 
tion (ECMF) (e.g.,|Lada & Lada|2003|rKroupa & Weidner|2003[ 


[Weidner & Kroupa|20051l. We here assumed that SC formation 
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occurred information epochs of length 6t at a constant SFR dur¬ 
ing which the ECMF is fully populated. 

In the following, we derive the overall mass distribution 
of SCs for SC populations that formed during a Hubble time 
of SC formation, for instance, around an individual galaxy. 
Analogously to the ECMF, this mass distribution function is 
called the integrated galactic embedded cluster mass function 
(IGECMF). It can be obtained by summing the ECMFs of all 
star-formation epochs that occurred during the different galaxy- 
galaxy encounters. However, the involved ECMFs are not iden¬ 
tical; According to the SFR-Meci,max relation (|Weidner et al. 
|2004| l, the higher the SFR, the higher the mass of the most mas- 
sive SC, Meci.max^ ^nd vice versa. Hence, during an SC formation 
epoch with a high SFR, more and heavier SCs form than within 
a period with a low SFR. Therefore it is necessary to scale the 
involved ECMFs by the respective SFR before summing them. 
Thus, one has to know how the SFRs of the different formation 
epochs are distributed. This distribution function of all involved 
SFRs is called F'(SFR). Mathematically, to obtain the IGECMF, 
one has to integrate the ECMF scaled by E(SFR) over all possi¬ 
ble SFRs. Physically, one has to sum all SC populations resulting 
from the different formation epochs over time. 

The distribution function of SFRs, E(SFR), is here quanti¬ 
fied from a total of six star formation histories (SFHs): i) two 
exponential SFHs ( Gavazzi et al.||200^ and ii) four power-law 
SFHs. The latter have not been investigated until now, but by 
analyzing them, a fundamental relation between an SFH and its 
E(SFR) is discovered. From the obtained E(SFR), we determine 
the desired IGECMF. 


As mentioned previously, the IGECMF describes how the 
masses of newly born SCs are distributed and must not be con¬ 
fused with the mass distribution function of SCs observed to¬ 
day, the so-called present-day mass function. The reason is that 
each distribution of SCs will suffer from changes driven by stel¬ 
lar and internal dynamical evolution as well as dissolution and 
SC disruption due to relaxation, dynamical friction, or tidal ef¬ 
fects. Starting with an IGECMF, the present-day mass function 
can be derived by taking into account all these phenomenons. 
Since the destructive processes are studied in detail elsewhere 


(e.g., IBaumgardt & Makino] 20031 |Lamers ^1-1 

2005al 120101 

2013 

lAlexander & Gieles||20121 Alexander et al. 

20141 Gieles 

et al. 

2011 

2014 |Brockamp et al. 2014| see also Portegies Zwart 

et al. 

2010 

and references therein), the subsequent investigations 


are restricted on the determination of the IGECMFs, which re¬ 
veal how the birth stellar masses of SCs will theoretically be 
distributed as a function of different SFHs. To this end, a new 
optimal sampling technique is developed that allows sampling a 
population of SCs that ideally resembles the underlying distribu¬ 
tion function in terms of the number of objects and their mass. 


1.1. Summarized approach 

After introducing the necessary background in Sects. |^an d0 we 
develop in Sects. [^-[Tjhow the theoretical overall mass distribu¬ 
tion of SCs is shaped after a star formation period of a particular 
length. Here we list the main procedure steps and where they are 
described in detail; 

1. Different SFHs are assumed that reveal how the SFR evolved 
over a Hubble time (Sect. |^. Based on the assumption that 
SC formation occurs in epochs of duration 6t (Sect. |2.2| de¬ 
termination in Sect. [^, the SFH can be divided into individ¬ 
ual formation epochs each at a particular SFR (Fig.|^. 


2. For each formation epoch the mass of the most massive SC, 
Med,max, Can be deduced according to the SFR-Mgci.max rela¬ 
tion (Eq. 1^ Eig. 1^ that fully determines the ECME of that 
formation epoch (Sect. [2^ . Thus, the only independent pa¬ 
rameter is the SER that determines all other quantities. 

3. The overall mass distribution of SCs after a certain period 
of SC formation - here a Hubble time - emerges from the 
accumulation of all SC ever formed. This approach is called 
the superposition principl^ Two different methods are used 
to obtain the final SC mass distribution of birth stellar masses 
(Sect.|3; 


(a) 


(b) 


Eirst, the superposition of the ECMFs of all formation 
epochs modulated by the distribution function of SFRs, 
E(SFR) (definition in Sect. |2.3| determination in Sect. 
results in Fig. [7]i, leads to the IGECMF (definition in 
Sect. 2.3 determination in Sect. results in Fig. 0. 
Discretizing the IGECMF gives the overall mass distri¬ 
bution of SCs (continuous lines in Fig.fTOll. 

First, the ECMF of each formation epoch is discretized 
into one individual SC population using an accurate sam¬ 
pling technique (Sect. ED- Then the superposition of all 
these SC populations leads to the final SC mass distribu¬ 


tion (dashed lines in Fig. 101. 


We show that the results from (a) and (b) agree. 


To treat this accurately, some mathematical derivations are 


required (especially Sects. 2.1 - 2.3 i. To facilitate the under¬ 


standing of this approach, we recommend reading Sects. |2i4| and 
|2.5| in which we explain how the overall star cluster mass func¬ 
tion emerges based on different SFHs. This is additionally de¬ 
picted in Fig.|^ which visualizes how the superposition principle 
works. 

The key finding of this work is that the overall mass function 
of SCs is expected to exhibit a turn-down at the high-mass end if 
the SFR changed significantly during the considered SC forma¬ 
tion timescale. Since the mass of an SC changes during lifetime, 
an observed SC mass distribution cannot be directly compared to 
a theoretical SC mass distribution. Thus, we recommend reading 
Sect.|^ which summarizes to which extent the predicted shape of 
the SC mass function can be observed and which corrections are 
necessary to enable this comparison in order to obtain reasonable 
results. Finally, we conclude our investigations in Sect.|^ 


2. Underlying theory 

In this section, we first introduce a significantly improved opti¬ 
mal sampling technique in Sect. |2.1| Then we define the ECMF 
from which the SFR-Mgei max relation is derived in Sect. 12.21 The 


approach used here is similar to that described by Weidner et al. 
( |2004| l and Maschberger & Kroupa| ( |20071 l. In Sect. |2 3| we de- 
scribe the theory of the IGECMF, followed by an exemplifica¬ 
tion of how the IGECMF emerges from the ECMF for different 
star formation activities in Sect. 12.41 The distribution function of 


SFRs, E(SFR), and the SFH are compared in Sect. 2.5 


2.1. Improving optimal sampling 


A simple and commonly used method of discretizing a parental 
distribution function is by random sampling. The distribution 


’ Similarly, Kroupa 1 2002 1 accumulated the velocities of all stars of 
all embedded SCs in a galaxy to gain their vertical velocity field, while 
|Kroupa & Weidn^ ( |2003| l obtained the integrated galactic IMF from 
the masses of these stars. 
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function is perceived as a probability distribution function from 
which values are diced using a generation function. Such an 
ensemble of sampled values naturally shows statistical devia¬ 
tions from the parental distribution function ( |Kroupa et al.|2013] l. 
However, for a wide range of applications it is necessary to ac¬ 
curately extract the number of objects as well as their individual 
masses from an arbitrary distribution function. This mass distri¬ 
bution function, denoted by ^(M), is described as 




r'wwtot 


M^(M) dM + 



M^(M) dM + ... H- 



Mf(M) dM + ... H- 



M^(M) dM -H 



M^(M) dM, 
(5) 


AM) = -, (1) 

where dN is the number of objects in the mass interval M to 
M -I- dM. The particular functional form of the distribution func¬ 
tion, ^(M), is specified below and is not required for the general 
approach. 

The outcome of any sampling technique should reproduce 
the number distribution, dAf/dM, as well as the mass distribu¬ 
tion, M dN jdM, as precisely as possible. If this is the case for 
a sample of objects, the corresponding distribution function is 
called ’’fully populated” throughout this paper. Moreover, for 
any lower and upper mass limits, M^in and M^ax, respectively, 
the total number of objects, Ntot, and the total mass, Mtot, should 
agree with the analytical computation of A^tot as well as of Mtot 
at the same time: 


since in total there are A^tot objects. With m\ - M^ax, the index 
of the upper limit, i, of each separate integral enumerates the 
individual objects. Thus, each separate integral must fulfill the 
two following requirements: 


1. Each integral must give one object. Integration of ^(M) 
within the limits m, and m,+i yields exactly unity: 


1 = 



( 6 ) 


2. Then the mass of this /-th object, M,, is determined by 


Mi = 


prrii 

Jm,+i 


M ^(M) dM, 


(7) 


where the limits m, and m,+i have to be equal to those in 
Eq.|6] 


-^tot ~ 




f(M) dM, 


M f(M) dM. 


( 2 ) 

(3) 


Thus, the quality of a sampling method can be measured by how 
accurately the outcome resembles dNjdM and M dNjdM and 
how well the actual values for A^tot and M^t agree with the ana¬ 
lytical values (Eqs.|^and|^. Compliance with one of these four 
conditions does not imply that one or all other conditions are 
fulfilled as well. 

One technique used to do this is the optimal sampling 
method developed by Kroupa et al. (20131 and later incorpo¬ 
rated into the extended software package originally published 
by jPflamm-Altenburg & Kroupa|(|2006|l. It is designed to gen¬ 
erate a population of stars from the initial mass function (IMF). 
The procedure requires the analytical form of the IME, the phys¬ 
ical upper mass limit for stars, m^ax^ and the total stellar mass 
of the embedded SC, Med. As shown by Kroupa et al.| ( 20T3] l, 
their Eig. 1, optimal sampling nicely reproduces the shape of the 
IME, ^iMF(Af) = dN jdM, without introducing any Poisson noise. 
However, a closer look reveals that optimal sampling does not 
fulfill Eq. as we show below. 

Is it possible at all to devise a sampling technique that fulfills 
all four conditions and works without adding stochastic fluctua¬ 
tions to the outcome? It is, as we develop in the following. 

Starting with Eqs.j^an we divide both integrals into Atot 
separate integrals, each integral representing one individual ob¬ 
ject: 


These two requirements ensure that the number distribution, 
dN jdM, and the mass distribution, M dN jdM, are reproduced 
and that Atot and Mtot agree with the analytical values. Since 
nii+i < nii with increasing number i the objects become less mas¬ 
sive. 

As our work considers the formation of SC distributions, our 
task is to generate an ideal population of SCs. However, the un¬ 
derlying concept is so general that it can be applied to any other 
type of object. Eor the sake of simplicity, we assumed that the 
mass distribution function of SCs follows a one-part power law 
with the index p 


^(M) = k 


M 

Mn,„ 


( 8 ) 


within the lower and upper mass limit, Mmin and M^ax, respec¬ 


tively. k is a normalization constant. Similarly to Weidner et al. 
(|2004|l, this function is normalized as follows: 


1 = 



(9) 


with a truncation mass Mtrunc = This leads to a normalization 
constant, k. 


k^(p-\)M-^^. ( 10 ) 

Here and in all following equations, >6 > 1 must be fulfilled. 
Otherwise, the antiderivatives of p(M) and M ^(M) cannot be 
computed. 

These ingredients enable computing the individual masses of 
the SCs. Equationj^implies for the {i + l)-th integration limit of 
any of the separate integrals from Eq. 


Aft = ^(M) dM -H ^(M) dM -h ... -h 

pm-, pmi pA/max 

^(M)dM + ...+ ^(M)dM+ f(M)dM, 


(4) 


pnii 

1= f(M) dM = 

Jm/+i 

n = {mp + m]^^ with mi = M^ax, 


( 11 ) 
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which allows iteratively determining the integration limits of all 
separate integrals in Eq. With these, the individual masses of 
all SCs of the ideal population can be computed, so that the i-th 
SC has a mass of 


Mi = 


f 

*Jmi. 


Mrr 




M ^(M) AM = 

(Inm; - lnm,+i) ,/? = 2 


( 12 ) 


- mf-f) , /? ^ 2, 


Moreover, using Eq.[^ the expected total number of objects, Wot, 
and their total mass, Mtot, as in Eqs.|^and[^ can be evaluated 
analytically by replacing the lower and upper integration limits 
in Eqs.[TT] and [T^ with M^in and Mmax, respectively: 


Mo 


Wot = f(M) dM = [ 

\ Mo 


S-l 


-1, 


(13) 


,^tot — 


f 


'^max 

ttot- I M^(M)dM 


[ Mtnax (lit Mitiax lit Moiin) , ^ “ 2 


,13*2. 


Mo 


I 

i-p 


P \ (Mmax) j 


(14) 


We illustrate the performance of the introduced sampling 
technique and the comparison to the original optimal sampling 
method ( Kroupa et al.|201^ with an exemplary calculation. To 
quantify this comparison, we generated SCs with both sampling 
techniques based on the following assumptions: 


Mass distribution function, ^(M): The masses of the SCs are 
distributed according to Eq.j^with an index (3 - 2.0. Eor any 
other value for the index, one obtains the same qualitative 
results, therefore these results are omitted here. 

Lower and upper limits: The lower limit for SCs is assumed 
to be constantly M^in = 5 Mq. The upper limit is varied in 
the range 10 Mq < M^ax <10^ Mq. 

Normalization: 

• New improved optimal sampling method: Using Eq. 
implies Eq. 10 so that the normalization solely depends 
on Mmax because (3 is fixed. 

• Optimal sampling: The normalization is calculated as 
Pflamm-Altenburg & Kroupa ( 2006|l and requires the 


in 

lower and upper limits, Mjj,in and M^ax and the total mass 
of the population, Mtot- Thus, the analytical expectation 
for Mtot from Eq. 14 is provided as an input variable. 

The normalizations of both sampling techniques lead to 
quantitatively the same ^(M) if Mmin, Mmax, and Mtot are 
equal for both techniques. 


Populations of SCs were generated for the above range 
of Mtnax using both sampling techniques. Eor an exemplary 
case with M^ax = 10000 Mq, the distributions chV/dM and 
M ANjAM are shown in Eig. [T] The result of optimal sampling 
is indicated with a dashed line, while a continuous line is used 
for the result of the new sampling technique. The parental dis¬ 
tribution function, ^(M) and M f(M), is overplotted with a dot¬ 
ted line. Clearly, both sampling methods reproduce the parental 
function even if the resulting distributions differ slightly from 
each other. Eor lower or higher values of M^ax, these distribu¬ 
tions are shifted to lower or higher masses, but look qualitatively 



10 100 1000 10000 
M[M0] 


Fig. 1. Comparison of the resulting number distribution, AN! AM, 
(upper panel) and mass distribution, M ANfAM, (lower panel) 
obtained from optimal sampling (dashed lines) and the new sam¬ 
pling technique (continuous lines). The underlying mass distri¬ 
bution function, ^(M), is overplotted with a dotted line. Here, an 
exemplary case with M^ax = 10000 Mq is shown. 


the same. Erom this point of view, none of the sampling tech¬ 
niques can be favored over the other. 

In addition, the resulting total numbers of SCs, Wot, and their 
total masses, Mtot, of both sampling methods are compared to 
the analytical expectations from Eqs. [13 and Eor each run 
we calculated the absolute deviations with respect to Wot, |MV|, 
and to Mtot, |AM|. They can be found in Table [^together with the 
results. The absolute differences, |AN| and |AM|, obtained from 
optimal sampling are marked with crosses, while those from the 
new sampling technique are drawn as open circles in Eig. [^ 
The acceptable deviation is one SC of the lowest mass, that 
is, AN = 1 and AM = Mmin = 5 Mq because it cannot be guar¬ 
anteed that the expectation for Wot (Eq. [H will give an exact 
integer value. These thresholds are indicated with dashed lines 
in Eig. [^ Every data point that lies below this is compliant with 
the analytical expectation (Eqs.[T3]and[T4ll. Eor a certain combi¬ 
nation of parameters {j3 = 2.0, Mmin = 5 Mq, Mmax = 10' Mq 
with X being a natural number), a natural number is analytically 
obtained for the total number of SCs, Wot (Eq. [T3] l. The new 
sampling technique always leads to a discrete Wot (cf- Table [^, 
so that for the upper combination of parameters it simply exactly 
reproduces the analytical expectation for Wot and M^t. In these 
cases, the deviations |AN| or |AM| are thus zero and are plot¬ 
ted at 0.01 in Eig. [^because otherwise these data points will be 
missing since we used a logarithmic scale. 
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Table 1. Comparison of optimal sampling and the new sampling technique in terms of their absolute differences, |AAf| and |AM|, 
from the analytical expectations for Ntot and Mtot (Eqs.[T3]and[T4|) as a function of the upper mass limit, M^ax- All deviations above 
the acceptable level (AN - 1 and AM = 5 Mq) are shown in bold. All values are rounded. 



analytical value 

optimal sampling 

new sampling method 

M„ax 

A,o, 

^tot 

Ato, 

|AA| 

^tot 

|AM| 

A« 

|AA| 

^tot 

|AM| 

[Mo] 


[Mol 



[Mo] 

[Mo] 



[Mo] 

[Mo] 

21.5 

3.3 

31.5 

4 

0.7 

31.0 

0.5 

3 

0.3 

29.9 

1.6 

46.4 

8.3 

103.4 

9 

0.7 

99.9 

3.5 

8 

0.3 

102.0 

1.4 

100 

19 

299.6 

21 

2 

298.1 

1.5 

19 

0 

299.6 

0.0 

215.4 

42.1 

810.8 

45 

2.9 

807.3 

3.5 

42 

0.1 

810.3 

0.4 

464.2 

91.8 

2 103.0 

97 

5.2 

2 100.2 

2.8 

91 

0.8 

2098.8 

4.2 

1000 

199 

5 298.3 

208 

9 

5 296.2 

2.1 

199 

0 

5 298.3 

0.0 

2 154.4 

429.9 

13 068.5 

446 

16.1 

13 068.1 

0.4 

429 

0.9 

13 064.0 

4.4 

4641.6 

927.3 

31717.7 

956 

28.7 

31716.1 

1.6 

927 

0.3 

31716.1 

1.6 

10000 

1999 

76009.0 

2052 

53 

76 005.0 

4.0 

1999 

0 

76 009.0 

0.0 

21 544.3 

4 307.9 

180292.4 

4410 

102.1 

180290.4 

2.0 

4 307 

0.9 

180288.0 

4.3 

46415.9 

9 282.2 

424053.7 

9481 

198.8 

424051.9 

1.8 

9 282 

0.2 

424052.8 

0.9 

100000 

19 999 

990348.8 

20391 

392 

990347.0 

1.7 

19 999 

0 

990 348.8 

0.0 

215 443.5 

43 087.7 

2 299 000.7 

43 868 

780.3 

2 298 998.8 

1.9 

43 087 

0.7 

2 298 997.2 

3.5 

464 158.9 

92 830.8 

5 309 301.9 

94 395 

1564.2 

5 309 297.7 

4.2 

92 830 

0.8 

5 309 298.1 

3.9 

1000000 

199 999 

12 206072.6 

203154 

3155 

12 206 070.8 

1.8 

199 999 

0 

12 206072.6 

0.0 

2154434.7 

430 885.9 

27 950776.1 

437 276 

6390.1 

27 950776.5 

0.4 

430 885 

0.9 

27 950771.4 

4.7 

4641588.8 

928 316.8 

63 780672.7 

941318 

13001.2 

63 780665.3 

7.4 

928 316 

0.8 

63 780668.8 

3.8 


For \AM\, both approaches lead to a total mass, Mtot, which 
agrees with the analytical expectation from Eq.[T4| (lower panel 
of Fig. 1^ Table T]). However, we note that at the largest M^ax 
the data point belonging to optimal sampling has |AM| = 7.4 Mq 
(T able [^1 and is therefore located slightly above the threshold in 
Fig. 1^ The reason for this is unclear, but it might be related to 
the accumulation of tiny random deviations. 

For |AAf|, the upper panel of Fig. shows that in all cases 
the total number of SCs generated by the new sampling tech¬ 
nique agrees with the analytical expectation (Eq. [Ol. However, 
this hnding does not apply to optimal sampling: All runs with 
Mjjiax ^ 100 produ-CS more SC-s tli3.ri expected from 


As can be seen, the surplus number of SCs increase with increas- 
ing Mmax (Table [T] upper panel of Fig.[^. 

In conclusion, optimal sampling is not optimal in the sense 
that it does not lead to the correct total number of SCs, Atot, 
even if it reproduces the total mass, M^t, and the sampled SCs 
are distributed according to the parental distribution function 
(Fig. 0. In contrast, we introduce here for the first time a sam¬ 
pling method where the outcome is able to resemble the underly¬ 
ing distribution function in terms of dA/dM and M dNfdM and 
has a total number of SCs, Atot, and a total mass, Mtot, which 
fully agrees with the analytical expectations (Eqs. 13 and 14 1 . 
Thus, the introduced new sampling method can be termed im¬ 
proved optimal sampling. 

We add that the normalization (Eq. may seem to be arbi¬ 
trary since there is no physical reason why the integration of a 
distribution function between the theoretical upper mass limit, 
Mtnax, and the ttTincation mass, Mtmnc = should exactly yield 
unity. However, this mathematical step is essential since it allows 
Mmnx to vary: Here, M^ax cannot be treated as a constant quan¬ 
tity since it depends on the SFR, as we derive in Sect. |2.2| For 
instance, for a stellar IMF, the most massive star, is deter¬ 
mined by the total stellar mass of the SC, Med, according to the 
'Wmax - Med relation ( |Weidner et al.|2010b||20i3l l, while the stel¬ 
lar truncation mass would be Mxmnc = 150 Mq (e.g., [Weidner^ 
Kroupa|2004 1 . This already shows the universal applicability of 


the new sampling technique: For stars, the underlying distribu¬ 
tion function would be a multiple-part power-law IMF to which 


the new sampling technique can be applied in the same man¬ 
ner as for SCs with an ECMF defined by a one-part power law. 
More generally, this method can be used to sample any type of 
object that can be represented by a distribution function that is 
a piecewise power law and where avoidance of Poisson noise is 
desired. 

The following instruction describes how a population of ob¬ 
jects can be created with the new sampling technique: 


1. Dehne the (mass) distribution function, ^(M), and its lower 
and upper limits, M„xm and M^ax- Remember that for SCs 
Mmax depends on the SFR (Sect. 2.21, while for stars the most 
massive one, mmax, is determined by the total stellar mass of 
the SC, Med, according to the m^ax “ A^eci relation. 

2. Apply a normalization. Here, we assumed Eq. but other 
normalizations are conceivable as well. For stars, the as¬ 
sumption of a certain Med directly normalizes ^(M). 

3. Determine the integration limits of Eq. |^in consideration 
of Eq. 1^ If the antiderivative of Eq. cannot be calculated 
analytically, the integration limits can also be computed nu¬ 
merically by starting at M^ax and integrating downward. If 
a one-part power law with index ji is assumed and the same 
normalization is used (Eqs. [9|and [T0|), all integration limits 
can be directly calculated from Eq.|l 1| 

4. Using the integration limits, compute the individual masses 
of all the objects from Eq.|^ Again, this can be done numeri¬ 
cally if the respective antiderivative does not exist. Using the 
same assumptions as we did here, the individual masses can 
be directly calculated from Eq. 


The advantages of the new improved optimal sampling tech¬ 
nique are that it fulhlls all of the four conditions stated at the 
beginning of this section, meaning that the number distribu¬ 
tion and the mass distribution match the underlying mass dis¬ 
tribution function. In addition, the total number of objects, Atot, 
and their total mass, M^t, match analytical expectations (Eqs.|^ 
and|^. As requested, the outcome is free of stochastic fluctua¬ 
tions. Moreover, if the antiderivatives of dA/dM and M dA/dM 
(cf. Eqs. 0-0 can be calculated analytically, then the whole 
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Fig. 2. Comparison of the resulting difference, |AAf| and |AM|, 
obtained from optimal sampling (crosses) and the new sampling 
technique (open circles) with respect to the analytical expecta¬ 
tion for A^tot and Mtot (Eqs.[T3]and[l4|. The dashed lines indi¬ 
cate up to which level the deviations are acceptable (AN = 1 and 
AM - 5 Mq). The corresponding values can be found in Table[T] 
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sampling can be performed analytically as well, which saves 
computational time. 


2.2. Embedded cluster mass function (ECMF) 

The ECMF is the mass distribution function of young, embedded 
SCs that were formed during one star cluster formation epoch 
(SCFE). Observations suggest that the stellar masses of young 
SCs are distributed according to a power law with index y6: 




thVECMF 

AM 


- k 


M 

^max 


n 


(15) 


Mmax is the stellar upper mass limit for SCs formed during one 
SC formation epoch, k a normalization constant, and -p the 
slope of the ECMF lying in the range 1.6 < /? < 2.5 (direct mea- 


surements:|Zhang & Fall (|19991l;|Bik et al. 

20031l;|de Grijs et al. 

(|2003|l; Hunter et al.|(|20031l; Lada & Lada 

( 200311; IFall (|20041i; 

Gieles et al. (|2006ai; de Grijs & Anders 

(200^; McCrady & 

Graham (|2007 1 ; de Grijs & Goodwin (|^ 

1008|l; Dowell et al. 

(|2008|l;|Whitmore et al.lPOlOJl; Chandar et al. pOlOj 201 IJl; de- 

rived from models: e.g.,|Kroupa & Boily ( 

2002 J; Weidner et al. 

(2004|l; see also de Grijs et al. P003|l, their Table 2, for slopes 


of the cluster luminosity function for different galaxies). 


It is debated whether the ECMF is a pure power law 
d Whitmore eraljlSOOTl |20T0l [Chandar et aT]|20T0l [2mT] ) or 


has a fundamental upper limit like a cutoff or an exponential 
turn-over at the high-mass end, which can be described by a 
Schec hter function (|Gieles et al.|2006a|bt |Bastian|2008[ |Larsen| 
|2009HBastian et al.|2012a|b| l. A differentiation between the two 

types is very difficult because of the low number of high-mass 
SCs (e.g., |Bastian||2008[ Bastian et al.||2012a l. If the ECMF is 
indeed truncated, [Haas & Anders] ( |201()[ ) did not expect the pre¬ 
cise shape at the high-mass end to be important. They investi¬ 
gated how the choice of the sampling technique and the index 
of the ECMF alters the integrated galactic initial mass function 
(IGIMF) - the analogon of the IGECMF for stars instead of SCs. 
They pointed out that an exponential turn-down and a trunca¬ 
tion of the cluster mass function will have a similar effect on the 
IGIMF, for which reason the precise shape of the ECMF is not 
expected to be important. 

[Bonatto & Bica| ^2012) , for instance, simulated how a 
Schechter-type initial cluster mass function of galactic GCs 
evolves due to stellar evolution and dynamical mass-loss pro¬ 
cesses into a present-day mass function for different MjL de¬ 
pendences on luminosity. Interestingly, the most realistic results 
were obtained for MjL ratios increasing with luminosity - as is 
observed for GCs - with a truncation mass of M^^wc ~ 10*° Mq, 
which means, effectively a pure power law without an upper 
limit. On the other hand, there might exist an upper mass limit 
for SCs since they form out of giant molecular clouds (GMCs) 
whose mass function is truncated at the high-mass end at least in 
M3 3, as reported by |Rosolowsky et ahlpOO? i. 

Since a completely limitless ECMF is unphysical, it is as¬ 
sumed that there is a theoretical upper mass limit for SCs, M^ax, 
which is not a fixed value, but depends on the SFR, as we derive 

(|2004 1 , we take a lower mass 


below. Following Weidner et al. 


limit for newly born SCs of Mmin = 5 Mq. The ECMF (Eq. 15 1 
was assumed to be a pure power law ranging from Mmin to the 
cutoff mass, M^ax. beyond which SCs cannot be formed. All 
following derivations are based on the findings from Sect. |2.1| 
meaning that the same normalization and the new sampling 
method were used so that all results obtained there are appli¬ 
cable here. The choice of a deterministic sampling technique is 
motivated by Pflamm-Altenburg et al. ( |2013 i and Kroupa ( 2015| 
see also references therein) since a self-regulated rather than a 
probabilistic or stochastic description of the emergence of an SC 
population out of a dense molecular cloud is consistent with the 
data. Thus, the total number of young SCs, A^ecmf. of one SC 
formation epoch is given by Eq. 13 


A^ecmf = I ^ecmf(M) dM : 

-^min 


M„ 


M„ 


I3~l 


1 . 


(16) 


According to Eq. 14 the total stellar mass of a young, embedded 
SC population, Mecmf. formed during one SC formation epoch, 
is determined by 




Ml 


ECMF 



M fECMF(m) dM 

(In Afrnax Itt iff min) 

1 /l _ ( 

B y j 


, P+ 2. 


(17) 


For all following computations we assumed the following for 
the SC formation process: 

1. During one SC formation epoch, all SCs and the stars therein 
form coevally and represent a single-age SC population. The 
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SC masses of this young SC population are always dis¬ 
tributed according to the ECMF (Eq. 151 within the limits 
Mmin and implying that the ECME is fully (or ’opti¬ 

mally’) populated (Sect. HI}- 

2. The index p of the ECME does not change with time. 

3. An SC formation epoch is of duration 6t, which is not a func¬ 
tion of time. 

4. During an SC formation epoch, the total mass of the young 
SC population, Mecmf, is formed at a constant SER; 


il^ECMF - SER ■ 6t. 


(18) 


SER = 



(19) 


Since Mmin, jS, and St are treated as constant quantities, the SER 
(Eq. is determined by M^ax alone. Because the ECME is 
a function of M^ax and Mmax is correlated with the SER, the 
ECME implicitly depends on the SER: 

^ECMF(i'T) = ^ECME,SFR(ilfmin < M < M[nax(SER)). (20) 

Regrettably, the theoretical upper mass limit for SCs of a par¬ 
ticular SC formation epoch, M^ax, is very hard to determine. 
However, the mass of the most massive SC of the same SC for¬ 
mation epoch, Med,max, Can be estimated. The ansatz of the new 
sampling technique enables relating the theoretical upper mass 
limit, Mmax, and the mass of the heaviest SC, Med,max, to each 
other. In the first condition (Eq. |^, / = 1 is assigned to the most 
massive SC since the SCs become less massive with increasing i 
(SeCt.|2.l|l, so OT/ - mi - Mmax 




^ecmf(M) dM - 


m 2 

Mma, 


i-13 


- 1. 


Solving for m 2 gives 


m 2 — 2 '-P M„ 


( 21 ) 


( 22 ) 


According to the second condition (Eq. |^, the mass of the most 
massive SC, Med.max, is determined by the integration limits 
from Eq.j^and replacing 1112 with Eq. [^results in 


Med,n 


/ 


'•^max 

M ^ecmf(M) dM 
m 2 

(In 2) Mmax 

(1-2^) Mmax 


y8 = 2 


(23) 


Inversely, the upper mass limit for SCs of one SC formation 
epoch, Mmax, as a function of the observed most massive SC, 
Mgd.max, reads 


which allows relating Med.max and SER to each other by replac¬ 
ing Mmax in Eq. with Eq. 24 Erom this arises the so-called 
SER-Med,max relation: 


SER = 


•^ecl,max 

5f-ln 2 

(ln("i;r) 


•'^ec^max 

(l-23)-(l-(g( 


6t 

^ ) j 


The total mass of one SC population, Mecme, can be calculated 
from Eq. [^if the lower and upper limit of the SC masses are 
known. Moreover, knowledge about Mecmf and St allows ex¬ 
tracting the underlying SER: Rearranging Eq. 18 using Eq. [T 7 | 
leads to an SER of 


,y8 = 2 

,P*2. 

(25) 


Indeed, observations suggest that Med.max scales with SER (see 
Eig. |4|below) as found for example by Weidner et al. ( 2004| l. In 
SectTT^we analyze this SER-Med,max relation to determine the 
length of one SC formation epoch, St. 


2.3. Integrated galactic embedded cluster mass function 
(IGECMF) 

The purpose of this contribution is to devise a mass distribution 
function describing how the birth stellar masses of young SCs 
are distributed after a formation episode that is much longer than 
one single SC formation epoch of length St. In this respect, it 
does not matter whether SC formation takes place continuously 
or in bursty phases. We assumed that any SC formation episode 
can be divided into a certain number of SC formation epochs 
of length St and that all properties mentioned in Sect. |2.2| apply 
to each epoch. Since the masses of SCs that formed during one 
SC formation epoch are distributed according to the ECMF, the 
superposition of all involved ECMFs will lead to the requested 
mass distribution function. Analogously to the ECMF, this time- 
integrated function is called the IGECMF. The IGECMF reveals 
how the birth stellar masses of SCs are distributed after a cer¬ 
tain SC formation episode, but it does not take into account any 
changes of the individual SC masses afterward. 

The ECMF of each SC formation epoch is determined by 
an individual Mmax (cf. Eqs. 15 and 
pendent on the SER (Eq. 


10 1 . Since Mmax is de- 


19 see also Eq. 20 1 , the distribu 


tion function of SFRs, called E(SFR), is needed to obtain the 
IGECMF. It describes the number of SC formation epochs 
(SCFEs) dAscEE(SFR) per SER interval: 


E(SFR) = 


dA^scFE(SFR) 


(26) 


dSFR 

The IGECMF will arise from the integration of the ECMF 

see 


over the whole range of SFRs in which the ECMF (Eq. 15 


also Eq. 20 1 is modulated by E'(SFR): 


^igecmf(M)^ ^ecmf.sfr(M) E(SFR) dSFR. (27) 

,7SFRn,in 


The resulting IGECMF will have a unique shape because 
E(SFR) carries information about the formation history, which 
is unique for any galaxy or galaxy cluster. Since the ECMF 
implicitly depends on SER (Eq. [20] l and the inverse function 
Mmax(SFR) cannot be calculated analytically (cf. Eq. 19 1 , the 


integration of Eq.|^ cannot be performed directly. For this rea¬ 
son, Mmax is substituted for the integration variable SER so that 


Eq. 27 becomes 


iffmax ~ 



Meal ,max 
-1 


Mgcl r 


,10 = 2 

,P*2, 


(24) ^igecmf(M) 


JmS 


ftcMF(M, Mmax) 


dSFR 




dM, 


(28) 
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with MJJJ" and MXx being the new limits of the integration 
over all possible Mmax corresponding to the lowest and highest 
SFRs, SFRinin and SFR^ax- The transformation from F’(SFR) to 
/^(Mmax) is carried out in Sect.|^ The derivatives of Eg. [T^ are 


dSFR 

dM^a, 



06- 


/ Mmin 

-/5\1 

\ A/niax / 

jj 


.>6 = 2 


(29) 


The integral in Eg. 28 is eguivalent to a summation of ECMEs 
up to their individual F{M^s.id determines how often each 
ECME contributes to the overall IGECME since it reveals how 
often the corresponding SER occurred. Thus, this methodology 
is called the superposition principle. 

It would be convenient to directly extract the total number 
of SCs, Mgecmf. as well as their total mass, Migecmf. from the 
IGECME. In the same way as for the ECME, we use the new 
sampling technigue by applying the criteria in Egs.|^and|^to the 
IGECME: The i-th SC is obtained from the IGECME (Eg. 
if the integrations limits m,+i and m, are chosen such that the 
integral over the IGECME is exactly unity: 


^igecmf(-^) dM . 
nif+l 


Then the mass of the f-th SC is determined by 

pnij 

MsclJ - I M flGECMF(^) dM, 


(30) 


(31) 


where the limits ot,+i and m, must be egual to those in Eg. 30 
Conseguently, the total number of SCs, AficECMF. with masses 
within the limits Mmin and M^ax resulting from a SC formation 
episode of length 6t is given by 


Mgecmf = I ^igecmf(^) dM, 

Afniin 

and its total mass, Migecmf. can be calculated from 


Migecmf 


nMma 

A/min 


M ^igecmf(M) dM. 


(32) 


(33) 


We focus on how the birth stellar masses of all SCs ever 
formed will be distributed after an SC formation episode of arbi¬ 
trary duration. This mass distribution is computed by purely su¬ 
perposing the single-age SC populations of many SC formation 
events, allowing the SER to change with time. Thus, neither stel¬ 
lar nor dynamical evolution leading to mass loss or even to the 
destruction of SCs are taken into account here. However, these 
effects and the impact of the tidal held are discussed in Sect. 
and must be accounted for as soon as the derived SC mass dis¬ 
tributions are compared to observed mass distributions of SCs. 


2.4. From the ECMF to the IGECMF - an exemplification 

Since the subject matter of the previous sections is very theo¬ 
retical, the interrelation between the ECME and the IGECME 
is exemplihed with Eig. in this section. We sketch how dif¬ 
ferent star formation activities (top panels, labeled ’a’) inhu- 
ence the ECMEs (middle panels, labeled ’b’) and thereby shape 
the IGECME (bottom panels, labeled ’c’). A double-logarithmic 
scale is used so that the power-law ECMEs appear as straight 
lines. 


Starting with the easiest case (I), there is exactly one SC for¬ 
mation epoch of length 6t at a relative high SER (la). Converting 
this SER to its respective M^ax (vertical thin dotted line in panel 
Ib) visualizes up to which mass the corresponding ECME is pop¬ 
ulated once - since only one formation epoch occurred (Ib). 
Thus, the respective IGECME (Ic) is egual to the ECME (Ib) 
since just one ECME contributed. 

In the second case (II), the SER remains constant at a 
medium level over ten SC formation epochs, each of duration 6t 
(Ila). Again, converting the SER to its respective Mmax dehnes 
the ECME (Ilb), which is populated ten times in total. Thus, the 
corresponding IGECME (lie) is shifted upward by a factor of ten 
and is truncated sharply at M^ax, which is lower than in the first 
case (1). 

Erom examples (I) and (II) it becomes clear that, on the one 
hand, the level of the SER is relevant since it determines the up¬ 
per mass limit of the ECME, and on the other hand, that how 
often this particular SER appeared constitutes how often the cor¬ 
responding ECME contributes to the IGECME. This is exempli¬ 
fied in the third case (III), where SC formation takes place at 
different SERs and each SER appears a certain number of times 
(Ilia). Each SER has its own respective ECME, which is popu¬ 
lated the number of times the SER occurred (Illb): The ECME 
corresponding to the lowest SER is populated one hundred times, 
while the ECME belonging to the highest SER is populated just 
once. According to the superposition principle, the IGECME 
(IIIc) is composed of all contributing ECMEs. The summation 
has to be carried out in the mass ranges separated by the thin 
dotted lines by taking into account how often each ECME oc¬ 
curred: In the highest mass range, only the ECME correspond¬ 
ing to the highest SER has to be considered, while in the lowest 
mass range all ECMEs contribute. Thus, one observes jumps in 
the IGECME (IIIc). 

The most realistic case is presented in the last column (IV). 
As in (Ilia), SC formation takes place at different SERs and a 
different number of times (IVa), which dehnes how often each 
corresponding ECME will be populated (IVb). Superposing all 
these ECMEs - each multiplied by the number of formation 
epochs of the respective SER - leads to the IGECME (IVc). 
Since the summation has to be performed separately in each 
mass range, the IGECME exhibits jumps, as in the case before 
(IIIc). 

If even more SC formation epochs occur at SERs lying be¬ 
tween the considered ones (IVa), even more but smaller jumps 
will appear in the IGECME. It will develop a curved shape, as in¬ 
dicated by the dashed line (IVc), meaning that it becomes steeper 
toward the high-mass end. 

2.5. Comparing E(SER) and the star formation history (SFH) 

The distribution function of SERs, E(SER), represents all con¬ 
tributing star formation activities, but not their chronological or¬ 
der. Conseguently, it is different from the SEH, which describes 
how the SER evolves with time, meaning SEH = SER(f), and 
should not be confused with it. However, F'(SER) can be derived 
from a known SEH: The SEH has to be divided into single SC 
formation epochs of length and for each SC formation epoch 
the associated SER has to be determined. Counting how often 
(Ascff) every SER appeared reveals the distribution of SERs, 
A^scff(SER), from which F’(SER) can be calculated by applying 
Eg.|^ We show this in detail for six considered SEHs in Sect.|^ 

Eor instance, a constant SEH at a level of SER = 10 Mgyr^ 
continuing over ten SC formation epochs leads to F'(SER) be¬ 
ing a delta function at SER = 10 Mgyr ' normalized to 10. Eor 
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Fig. 3. Sketch of how different star formation activities (top panels, ’a’) influence the ECMFs (middle panels, ’b’) and thereby shape 
the IGECMF (bottom panels, ’c’) for four exemplary cases (columns I - IV) according to the superposition principle. 


comparison, a star burst lasting over one SC formation epoch at 
SFR = 100 Mgyr^^ corresponds to F'(SFR) being a delta func¬ 
tion as well, but at SFR = 100 Moyr ' and normalized to unity. 
According to Eq. [T^ both cases give the same total mass in the 
end, namely Mtot = 10-10 Mgyr^' ■ df = 1 ■ 100 Mgyr^' ■ 6t. 
However, as can be seen in Fig. cf. panel (Ic) and (lie), the 
distribution of the masses differ considerably since in the second 
case SCs of much higher masses can form due to the higher SFR. 

As shown exemplarily above, any SFH can be converted 
to an F(SFR). Conversely, the SFH cannot be derived from a 
known F'(SFR) because F'(SFR) does not reveal the chronologi¬ 
cal order of the involved SFRs. 


3. Determining the star formation duration 6t using 
the SFR-Meci,max reiation 

The SFR-Meci,max relation originates from a relation between the 
global SFR of a galaxy and the brightest SC in the V band and 
was found by Larsen ( 2002| l. Later, Adamo et al. (2011 1 inves¬ 
tigated the same properties of massive young SCs in blue com¬ 
pact galaxies and found them to lie slightly above the upper end 
of the Larsen relation. Moreover, [Randriamanakoto et al.| ( |2013| l 
observed a similar relation in the near-infrared for the brightest 
super star clusters (SSCs) in luminous infrared galaxies. 

In an analysis based on |Larsen ( 2002| l, Weidner et al.| ( |2004) l 
showed the brightest SC to be the most massive young SC in 


most cases, even though the M/L-ratio of a stellar population de¬ 
pends highly on its age. They converted the SFR-brightest SC re¬ 
lation to the SFR-Meci,max relation and analyzed it for dwarf and 
spiral galaxies. Their data (young, most massive SCs (Med,max) 
vs. current, galaxy-wide SFR, taken from |Larsen|2002 Weidner 
et al. 12004] |Larsen|2009 provided by C. Weidner, private com¬ 
munication) are replotted in Fig.|^and confirm a correlation be¬ 
tween the SFR and Med.max- A typical error estimate is plotted 
in the bottom right corner. On the x-axis, the calibration of the 
SFR as a function of the infrared flux is the main contributor to 
the error, while on the y-axis the uncertainties mostly originate 
from converting the luminosity of a SC into a mass with an as¬ 
sumed M/L-ratio that strongly depends on the age of the SC (C. 
Weidner, private communication, see also Weidner et al.|2004 1. 


Apparently, there is some spread in the data, particularly 
above the relation. These data points mostly belong to dwarf 
and irregular starburst galaxies ( |Billett et al.| |2002[ |Larsen| 
20021. Several explanations have been suggested for this offset: 
Weidner et al. (2004 1 argued that an intense star formation activ¬ 
ity in a dwarf galaxy may be halted once a very massive SC has 
formed (the ’’quenching” hypothesis). The feedback of this SC 
may heat the surrounding dense gas and prevent further star for¬ 
mation. This scenario was supported by |Bastian| ( |2008| l based on 
a study of NGC 1569, a dwarf-irregular (post) starburst galaxy. 
According to the quenching hypothesis, the outlying data points 
may be located at incorrect positions in the diagram: The SFR 
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logio(SFR[MQ yr"']) 


Fig. 4 . A^eci.max VS. galaxy-wide SFR (replotted from 


Weidner 


et al. (20041, including new data points). The curves are least- 
squares fits according to the SFR-Med.max relation (Eq. l25]l with 
the fitting parameter 6t for all p between 1.5 (steepest curve) and 
2.6 (shallowest curve) in steps of 0.1. The curves for p = 1.6, 
and 1.8 are omitted. A typical error estimate is indicated in the 
bottom right corner. 


might have dropped significantly after the formation of these 
SCs, for which reason the data points would have to be hori¬ 
zontally shifted to the right since they formed at higher SFRs 
and may lie in the area of the other measurements. 

On the other hand, for the Milky Way (MW) galaxy and the 
Large Magellanic Cloud (LMC), [Fukui et'aF (19991 suggested 
that the stronger gravitational field in the MW compared to the 
LMC leads to a stronger fragmentation of molecular clouds, 
for which reason the MW is able to form solely open SCs. In 
contrast, the weaker gravitational field of the LMC allows the 


formation of more populous SCs. More generally, Billett et al. 
( 2002| l proposed that dwarf galaxies are able to form massive 
SSCs due to the absence of shear. [Weidner et al.| ( |2010a| l in¬ 
vestigated how shear forces act on GMCs in dwarf and spiral 
galaxies. They found that the presence of shear prevents GMCs 
to collapse into dense SSCs in spirals, while in dwarfs the lack 
of rotational support allows the formation of SSCs. Thus, dwarf 
galaxies would be able to form more massive SCs than spirals 
at the same SFR and therefore lie above the mentioned SFR- 
Meci.max relation. This would be equivalent to ^ecmf( 4T) deviat¬ 
ing from the canonical form (Eq.[T5]l for some dwarf galaxies. 

To begin, all data points are included, but we examine below 
how the analysis is influenced when the four data points lying 
above the relation in Eig. |^with Med max between 10^ Mq and 
10'’ Mo and logio(SER) < 0 are excluded. These data points 
belong to measurements in NGC 1705, NGC 1569, the Small 
Magellanic Cloud, and the LMC, viewed from left to right. 
However, assuming that these data points are placed at the cor¬ 
rect positions, this would require an SC formation timescale of 
at least 10 Myr according to a simple estimate using Eq.fTS] 

The length of one SC formation epoch, 6t, was determined 
by fitting the SER-Med.max relation (Eq. [25] l to all data points 
using the least-squares method. Since 6t rm^t vary with p, a fit 
for each p was performed separately in the range from 1.5 to 2.6 
in steps of 0.1. Eigure|^ shows the fitted curves through the data 
points. The steepest curve corresponds Xop - 1.5, the shallowest 
curve iop - 2.6. Eor purposes of clarity, the curves belonging to 
P - 1.6, and 1.8 are omitted. 


Table 2. Duration of one SC formation epoch, 6t, and the re¬ 
duced as determined from the least-squares fits in Eig.|^for 
each p (column 1). In columns 2 and 3 all data points are used 
while in column 4 and 5 the four data points lying above the re¬ 
lation inEig.|^(with 10^ Mq < Med,max ^ 10® Mq) are excluded. 
All values are visualized in Eig|^ 
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1.5 
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0.56 
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1.8 
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0.81 

0.223 

1.9 
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1.24 

0.219 

2.0 
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0.216 
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0.218 
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0.475 
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0.230 

2.3 
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0.524 
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0.255 

2.4 

38.77 

0.591 
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55.19 
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Fig. 5. Duration of one SC formation epoch, 6t, (upper panel) 
and the reduced (lower panel) as determined from the fit¬ 
ting in Eig.|^with (crosses) and without the four outliers (filled 
circles) for each p. The values can be found in Table 
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Clearly, 6t increases with increasing p, ranging from 0.4 Myr 
to 186 Myr (Table column 2), as visualized by the crosses in 
the upper panel of Eig. A physical explanation might be that it 
takes longer to populate an ECME with a large p than an ECME 
with a small p due to the larger number of SCs for a given Mmax 
(cf. Eq. 16 1 . In addition, the reduced values are extracted 
from thent analysis (Tabled column 3) and indicated by crosses 
in the lower panel of Eig. p| Regarding x^g ^7 the most probable 
combinations of p and 6t are those for p < 2. Toward larger p, 
X^g^ increases but does not exceed unity. 

Eor comparison, the same fitting analysis is applied to the 
SER-Meci,max d^ta set (Eig. excluding the above mentioned 
four data points lying above the relation. The resulting length 
of one SC formation epoch, 6t, can be viewed in column 4 in 
Table [^ and is represented by filled circles in the upper panel 
of EigTlM It emerges that the values for 6t are somewhat lower 
than the previous fitting results (cf. columns 2 and 4 in Table 
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Fig.ig. This is anticipated since 6t can be found in the denom¬ 
inator of the SFR-Meci,max relation (Eq. 25 1 . Consequently, an 


increasing 6t induces the htting function to shift downward in 
Fig. 1^ However, fitting Eq. to the data without the outliers 
already places this function slightly below the previous fits since 
the outliers lie above the actual relation. Thus, dt is smaller than 
before. Moreover, the removal of the outliers leads to a smaller 
spread in the remaining data set and therefore to lower reduced 

values than in the previous htting. This becomes clear from 
the lower panel of Fig. where the obtained values are 
marked with hlled circles, and from comparing columns 3 and 5 
in Table |2] 

The theoretical star formation timescale of about 10 Myr fol¬ 
lows from the calculation of the Jeans time in molecular clouds 


(e.g., Billett et al.| 2002 


& Kroupa 2007| l. In simu 


Weidner et al. 2004 

|Maschberger 

ations performed by 

Bonnell et al. 


ing cloud disperses on a timescale of 10 Myr. 

There have been various attempts to estimate the timescale 
of SC formation observationally: From analyzing GMCs in the 
EMC, [Fukui et al. ( 1999| l estimated SC formation to proceed in 
a few Myr and a typical lifetime of a molecular cloud of about 
6 Myr. Fikewise in the FMC, [Yamaguchi et al.| ( [200T| ) found that 
SCs are actively formed over roughly 4 Myr and the host molec¬ 
ular clouds completely dissipates in about 10 Myr after the onset 
of SC formation. Another approach to observationally estimate 
the SC formation timescale is to measure the offset between Ha, 
emitted through recombination of hydrogen ionized by newly 
born massive stars, and CO, a tracer of molecular gas which is 
observed in star-forming spiral arms. Star formation times de¬ 
rived in this way by Egusa et al. (2004 2009 1 range from 4 to 
28 Myr, whereas half of the measurements lie between 11 and 
14 Myr. Similarly, [Tamburro et al. ( 2008| l compared images of 
spiral galaxies in HI from cold gas and 24 fim from warm dust 
heated by UV and find shorter timescales between 1 and 4 Myr. 

By combining the results from theory, simulations, and ob¬ 
servations, we hnd agreement that the formation of an SC pop¬ 
ulation through the formation and the dispersal of their birth 
molecular clouds occurs galaxy-wide on a timescale between at 
least a few Myr and at most a few 10 Myr. Our fitting results for 
6t, the duration of one SC formation epoch, which match these 
estimates, are highlighted in light gray, while the most proba¬ 
ble values are shaded slightly darker in Table ^ Additionally, 
observed values for fi, the index of the ECMF, are presented 
in light gray, while the values found most frequently are high¬ 
lighted somewhat darker. As one can see immediately, the col¬ 
ored entries in the two columns overlap over almost the full 
range. This demonstrates the reliability of the analytically de¬ 
rived SFR-Meci,max relation (Eq. l25]l since it naturally connects 
- without any adjustment - the empirical estimates of the two 
independent quantities fi and St in combination with the SFR 
vs. Med,max data from|Weidner et al.|(|2004|). Note that this find¬ 
ing is virtually independent of whether outliers are excluded or 
not (cf Table 1^. 

Since there has not been a definitive statement about the four 
outliers and the outliers do not change the results much, none of 
them is excluded. Thus, for all further calculations we use the 
values for St as given in column 2 in Table However, it is not 
expected that the further analysis will depend much on whether 
the outliers are excluded or not since in both cases the values for 
St are similar and increase with y6 in a similar way (cf. columns 2 
and 4 in Table Fig. |^. Moreover, we assumed that the SFR- 
Med,max relation, extrapolated to higher values, holds true. 


4. Considered star formation histories (SFHs) 

An SFH reveals how the SFR evolves with time, that means 
SFH s SFR(f). Various observations suggest that on cosmologi¬ 
cal scales, the SFR was higher in the past or for higher redshifts, 
z, than it is today (e.g., Schiminovich et al. 2005 [ Fe Floc’h et al. 
2005 1 see also references therein), meaning that SFR(f) must de¬ 
crease with time. For instance, this was observed for the MW by 
Kroupa| ( |2002| l, for the individual ga l axies N GC 584, NGC 3377, 
and NGC 3610 by Georgiev et al.|(|2012 l and for a sample of 
massive galaxies by Daddi et al. ( 2007| l. 

The simplest and most often considered SFH is a purely ex¬ 
ponential SFH. It is characterized by an SFR that has its maxi¬ 
mum at the beginning and exponentially decreases thereafter. A 
’’delayed-exponential” SFH was suggested by Sandage ( 1986) 1. 
Starting with SFR = 0, the SFR increases to a maximum and 
exponentially decreases thereafter. Field galaxies show a simi¬ 
lar development in the SFR between a redshift z ~ 4- and today, 
z - 0, with a peak at z ~ 1.5 (Madau et al. 1998|l. Th e two 
described exponential SFHs were used by Gavazzi et al. ( 2002| l 
and are considered here as well. 

Moreover, we also examined four power-law SFHs, which 
were chosen arbitrarily. They must have a similar evolution with 
t as an exponential SFH, namely starting at a relatively high SFR 
value and monotonically decreasing from there. Also, they were 
required to be easy to handle. Thus, the power-law SFHs have to 
be of the form SFR(f) cx r’^. To investigate different indices, rj = 
0.5, 1, 2, and 3 were considered. Interestingly, Filly et al. ( 1996| l 
found that the comoving luminosity density of the Universe can 
be reasonably well described by a power-law SFH with rj — 2.5. 

The examined SFHs and their individual normalization con¬ 
stants, c, are 


Exponential SFH, 


SFR(f)=—exp-; Cexp = M, 


Texp parameterizes how fast the SFR decreases with t. 


(34) 


Delayed-exponential SFH, 


SFR(f) = Cdei— exp 

"^del 


t fi \ 


V ^' del ^ 


5 ^del “ M^ot, (35) 


Tdei parameterizes at which time the SFR reaches maximum. 


Power-law SFH with rj - 0.5, 

SFR(f) = C0.5 ; C0.5 = 

Power-law SFH with rj - 

SFR(f) = Cl ; Cl = 


2 V^Hubble 




(36) 


In fHubbie - In St 


Power-law SFH with tj -2, 

SFR(f) = C2 ; C2 = Mtot ■ St, 


(37) 


(38) 
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Power-law SFH with rj -3, 

SFR(f) = C3 ; C3 = 2 Mtot ■ 6t^. (39) 

All the above SFHs are normalized such that the integration 
of the SFH (= SFR(f)) yields the total stellar mass, Mtot, of all 
SCs that ever formed; 


Mtot — 




SFR(f) df. 


(40) 


This normalization criterion cannot be applied to some of the 
SFHs because the lower, f = 0, and/or the upper limit, f = oo, 
would lead to a diverging integral. If this was the case, the lower 
and/or the upper limit were replaced by f = df and/or f = fnubbiej 


respectively, in Eq. 40 For the exponential and the delayed- 


exponential SFH, the parameters Texp and Tdei, respectively, were 
chosen to be 1 Gyr, which means that the predominant star for¬ 
mation activity concentrates on the first few Gyr, 


Texp = Tdei = 1 Gyr. (41) 

We assumed that over a Hubble time, SCs are formed with a 
total stellar mass of 


Mtot = 10'° Mo. (42) 

This amount is compatible with the total mass stored in SCs 
systems in today’s galaxy clusters. For instance, a simple esti¬ 
mate from the mass distribution of GCs and ultra-compact dwarf 
galaxies (UCDs) in the Fornax galaxy cluster by |Hilker| ( |2009| l 
leads to approximately this mass. However, without loss of gen¬ 
erality, one can assume any value for Mtot. The assumed mass 
estimate does not necessarily imply that the assembly of all these 
SCs will host the same amount of mass today. Since evolutionary 
effects are not taken into account here, mass loss through stellar 
evolution does not occur. In real SCs, massive stars lose gas due 
to stellar winds or supernova explosions, which becomes avail¬ 
able again for later generations of SCs. Thus, even if all SCs 
ever formed had a mass of 10*° Mq at their birth in total, the 
agglomeration of them will be significantly less massive today 
since SCs lose about 90 % of their birth stellar mass within sev¬ 
eral Gyr, while those with Mgci <10"* Mq dissolve completely. 

5. Determining f (SFR) from the considered SFHs 

According to the following procedure, the distribution functions 
of SFRs, F’(SFR), were calculated from the considered SFHs: 

1. Determination how many SC formation epochs occurred: 
Each SEH (Sect. is interpreted to be a sequence of single 
SC formation epochs and is divided into A^scfe = fHubbie/<^f 
epochs of length 6t as sketched in the left panel of Eig. 

2. Calculation of the average SER for each SC formation epoch: 
The SER is evaluated at the beginning and at the end of each 
formation epoch. Moreover, the integral of the «-th SC for¬ 
mation epoch 

^(n+l)-St ^(n+K)-Sf ^(n+l)-St 

I SFR(f)df= I SER(f)df-H I SER(f) df, 
\Jn-6t Jn-dt K)(n+K)-6t 

^1 I 2 

(43) 

is divided into two integrals and the limit (n + k) ■ 6t is cho¬ 
sen such that the two integrals are equal, that is, Ii - h- 



Fig. 6. Sketch of how F’(SER) is determined from a SEH: SER(f) 
(left panel, here an exponentially declining SEH, Eq. [3^ is di¬ 
vided into single SC formation epochs of length St. An average 
SERav is determined for each of the A^scfe formation epochs. 
Erom this, a histogram (right panel) is created in which every 
SERav has its own bin of height unity. F’(SER) for each SERav is 
determined by dividing unity by the width of the corresponding 
bin, ASER. 


This ensures that Eq. 18 is fulfilled for each SC forma¬ 


tion epoch. Then the average SFR of the n-th SC forma¬ 
tion epoch is evaluated for each SEH (Eqs. - [39] l from 
SERav,« = SER((n -H k) ■ St), k is individually determined for 
each SC formation epoch. 

3. Creating a histogram Ascee(SFR): Such a histogram reveals 
how often every SFR occurred. 

If an SFH is a monotonically decreasing function, each 
SFRav,« occurs once. In this case, for each SC formation 
epoch, one bin of height unity is created at SFRav.n. The 
width of the bin is defined by the SFRs at the beginning and 
at the end of that SC formation epoch. This is illustrated in 
the right panel of Fig. If the x- and y-axis are exchanged, 
the histogram Ascee(SFR) is depicted in the usual manner. 

If an SFH is not a decreasing function, SFRav,« are sorted ac¬ 
cording to their value. To create the same number of bins 
as there are SFRav,« values, the upper and lower limit of 
the n-th bin is computed from taking the arithmetic aver¬ 
age between the SFRav.n and the previous and the next value: 
(SFRav,n -H SFRav,n±l)/i 

Pursuant to this notion, A^scee bins of height unity are ob¬ 
tained without any gaps in between. This enables determin¬ 
ing F'(SFR) with the highest possible precision because the 
number of bins in the histogram is equal to the number of SC 
formation epochs. 

4. Calculation of F'(SFR): For each SFRav.n, unity is divided by 
the width of the n-th bin, ASFR, according to Eq. Since 
Mmin = 5 Mq, only those SERs are considered that lead to 
the formation of SCs more massive than 5 Mq. 


Eor the six considered SEHs from Sect. [4| (Eqs. [^-[^ with 
Eqs.[^-|42|), F'(SER) is determined as described above. This is 
done for all p of Table [^because the results may vary with St. 
Interestingly, the essential properties of F’(SER) do not change 
much with p, and thus F'(SER) of the six SEHs is shown only 
for p — 2.0 in black and for p - 2.3 in red in Fig. In all 
cases, F'(SFR) is composed of many individual data points, each 
of them corresponding to one SC formation epoch. Error bars 
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39 


with Eqs. 


41 


Fig. 7. F(SFR), the distribution function of SFRs, of the two exponential and four power-law SFHs (Fqs. 

[4^ for p - 2.0 (black) and p - 2.3 (red). Fach data point corresponds to one SC formation epoch. Error bars are indicated if they 
are large enough to display. Except for the delayed-exponential SFH, all F(SFR) can be exactly htted by a power law according to 
Eq. 44 The corresponding index y and the considered SFH are given in the corner of each panel. The panels are arranged such that 
y increases from the upper left to the lower left by viewing in clockwise direction. (A color version of this figure is available in the 
online journal.) 


are indicated if they are larger than the size of the used plotting 
symbol. 

It becomes immediately apparent that all F(SFR) - except 
for a delayed-exponential SFH, which is discussed below - are 
basically straight lines in the double logarithmic representation. 
This means that the quantity SFR itself is distributed according 
to a power law independent of whether the SFH (= SFR(f)) is an 
exponential or a power law. Power laws according to 

logio(F(SFR)) = -y logio(SFR) + g 
« F(SFR) = 10^ ■ SFR^^, 


per right comer of each panel - increases if the panels are viewed 
in clockwise direction, starting at the upper left panel and ending 
at the lower left panel. In Table the hve SFHs whose F'(SFR) 
follow a power law and their respective index y are collated. In 
addition, the four power-law SFH indices t] are plotted against 
their respective F(SFR) indices y in Fig. Interestingly, these 
data points can exactly be fitted by the function y - I + l/rj. All 
this indicates a fundamental relation between the type of SFH 
and the index of the its F'(SFR). 


Table 3. SFH and the corresponding index y of F(SFR) 


were htted to these F(SFR), from which the htting parameters 
y and g were determined. Figure visualizes a slight vertical 
shift between F(SFR) obtained for p — 2.0 (black) and p — 2.3 
(red), meaning that the y-intercept g changes with p, but the 
slope -y does not. Even if the normalization constants are var¬ 
ied by choosing a higher or lower total mass, Mtot, (Eq. |40| or 
another value for the parameter Texp (Eq. 41 1 in the exponential 
SFH (Eq. [34| ) - the values of y are unaffected. In Fig. |^the six 
panels containing the resulting F(SFR) of the six SFHs are ar¬ 
ranged such that the index y of F'(SFR) - as indicated in the up¬ 


SFH 

7 

exponential 

1.0 

power law, r) = ?! 

1.33 

power law, r) = 2 

1.5 

power law, r) = \ 

2.0 

power law, rj = 0.5 

3.0 


Figure shows that the F'(SFR) of the delayed-exponential 
SFH again increases toward the high-SFR end. This feature is 
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Fig. 8. F'(SFR) index y as a function of the index rj of the power- 
law SFHs. The indicated curve, y = 1+ !/?], exactly matches the 
data points. 


caused by the shape of the delayed-exponential SFH; The SFR 
increases first and then decreases, so that relatively high SFRs 
appear considerably more often than they do in a purely expo¬ 
nential SFH, which is a decreasing function at any time. As a 
result of the large number of SC formation epochs at high SFRs, 
F(SFR) at high SFRs is enhanced. After the delayed-exponential 
SFH has reached its maximum, it decreases exponentially in the 
same way as a purely exponential SFH. Within the regime of low 
SFRs, F(SFR) can thus be described by a power law as well that 
has almost exactly the same index y as F(SFR) of a purely expo¬ 
nential SFH. Therefore, the panel containing the T'(SFR) of the 
delayed-exponential SFH is placed to the right of the one with 
the purely exponential SFH. 

Indeed, the relation between t] and y is fundamental and can 
be derived analytically in the following way: T'(SFR) describes 
how the number of SC formation epochs, AfscFE. change with 
SFR. Since one is confronted with discrete numbers here, Eq.p^ 
needs to be discretized. 


F(SFR) = 


AAfscFE(SFR) 

ASFR 


1 

|SFR'(f)| -df' 


(45) 


The last equality becomes clear using the sketch in Fig. 
Each SER occurs once during the SC formation episode, that 
is, AAscee = 1 ■ The change in SER per unit time is by definition 
SER'(f) = dSER(f)/df, so that ASER = |SER'(f)| df. The absolute 
value of SER'(f) prevents T'(SER) from being negative since the 
considered SER(f) are decreasing functions. Moreover, SER(f) 
needs to be a monotonic function. Otherwise, there would exist 
a maximum or a minimum turning point so that SER'(f) would 
become zero and Eq. undefined. Therefore, this argumenta¬ 
tion does not apply for the delayed-exponential SEH. As a result, 
in this case, T'(SER) is not obtained analytically. 

The derivatives of the exponential and of a general power- 
law SEH are 


SER'(f) = -^^exp(-^1 = ——■ SER(f), (46) 

■^exp \ Axp/ Texp 

SER'(f) = -Tjcr’’-^ = -c(cr'')^ = -c(SER(f))^""’. (47) 


In Eq. 47 SER'(f) is rearranged such that SER(f) appears again, 
while all constant factors are combined to c. Inserting Eq.[46|and 
Eq.l^ respectively, in Eq.|45]gives 


F(SER) = 


^exp 

SFR(r) ■ 6t 


^exp 

SFR(0 


logic (T'(SER)) = logio(cexp) - logio(SER(f)), 


(48) 


F(SER) = 


c(SER(f))‘^’ -dt 


(SER(f))'^’ 


logio (F(SER)) = logio(cpow) - 1 + - logio(SER(f)). 


1 


7 


(49) 


One can immediately extract from Eq. that T'(SER) derived 
from any exponential SEH always has a slope of -y = -1. 


Deriving T'(SER) from a power-law SEH (Eq. 49 1 leads to a 
slope of -y = -(1 - 1 - 1 / 77 ) in exact agreement with the above 
finding. 

Because this derivation is not applicable to the delayed- 
exponential SEH, the corresponding F(SER) is not a straight line 
in a double logarithmic plot, meaning that it cannot be described 
by a power law, as already became clear from Eig. 


6. Deriving the iGECMFs from the obtained F(SFR) 


Now we can determine the IGECMEs from the derived T'(SER) 
by evaluating Eq. 28 Eor this, we transformed F(SER) to 
FiMmsa)- This was done by converting the argument (SER, x- 
axis in Eig.|^ to M^ax according to Eq. 19 The value of T'(SER) 
is not affected by this rescaling. Then, we integrated Eq. 28 over 
all possible M^ax from s Mmin = 5 Mq up to an individual 
MXx that corresponds to the maximum SER of the considered 
SEH. As with F(SER), the resulting IGECMEs for each SEH 
look very similar for different (3. Thus, we show in Eig. |^only 
the IGECMFs for p - 2.0 in black and for p - 2.3 in red. The 
resulting IGECMFs for one p share several similarities that are 
independent of the underlying SFH: 


• The starting position at the low-mass end is identical. 

• The shape of the lower mass part can be described by a power 
law with the slope of the underlying ECMF, represented by 
a dashed line in Fig. 

• The occurrence of a turn-down at the high-mass end: The 
IGECMFs become increasingly steeper toward the high- 
mass end, as compared to the parental ECMF. The IGECMFs 
are reminiscent of a Schechter function, which has a power- 
law behavior at the low-mass end and an exponential turn¬ 
down at the high-mass end. 

The turn-down shows several differences: 

• The mass at which the bending down begins: This position is 
only especially distinct for the IGECMF resulting from the 
power-law SFH with 77 = 0.5. For all other IGECMFs, this 
point is not prominent because of a smooth transition. 

• The upper mass limit: It is directly related to the maximum 
SFR (Fig. 1^ of the considered SFH through Eq. 

• The individual shape of the turn-down. 

A comparison of the IGECMEs for p - 2.0 (black) and p - 
2.3 (red) (Eig. 0 shows that only the relative positions of the 
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Fig. 9. IGECMFs for (3 — 2.0 (black) and fi — 2.3 (red) showing the mass distribution function after a Hubble time of SC formation 
according to the SFH indicated in the corner of each panel. Strictly speaking, these IGFCMFs represent how the birth stellar masses 
of SCs are distributed since evolutionary effects leading to a change in the SC mass are not taken into account. Shifted to higher 
or lower values, the dashed lines represent the underlying FCMFs in the same color for comparison. According to the underlying 
SFHs, the IGECMFs are arranged in the same manner as in Fig.[^ (A color version of this hgure is available in the online journal.) 


starting point at the low-mass end, the position where the turn¬ 
down appears, and the upper mass limit are shifted. The reason 
is that the SC masses are distributed differently depending on 
(3: The SFR-Med.max in Fig. shows that at a constant SFR a 
small (3 corresponds to a large Med,max vice versa. This is 
directly reflected by the IGECMFs: For (3 - 2.0, the IGECMFs 
continue up to a higher upper mass limit than the IGECMFs with 
P - 2.3 (cf. Fig.|^. However, the same total mass, Mmi, is stored 
in each IGECMF since all SFHs are normalized to the same Mtot 
(Eq. |40| ) independent of p. To compensate for the higher upper 
mass limit, the low-mass tail of the IGECMFs with p - 2.0 is 
slightly shifted to lower values relative to the IGECMFs with 
P - 2.3, which is best visible at the low-mass end (cf. Fig. |^. 
Thus, assuming the same Mmi, there will be fewer high-mass 
SCs and more low-mass SCs for an IGECMF with a large p than 
for an IGECMF with a small p - exactly the same applies to the 
ECMF. The IGECMFs of all other p exhibit the same features as 
listed above and are thus not shown here. 

The occurrence of an explicitly visible bend in the IGECMF 
- as the one for the power-law SFH with tj - 0.5 - is the re¬ 
sult of the underlying distribution of SFRs: The mass at which 
the bend appears corresponds to the upper mass limit, of 

the lowest SFR (Eq. [T^. All SFRs contribute to the formation 


of SCs with masses below the mass at which the bend appears, 
which is why the IGECMF in this mass range has the same slope 
as the parental ECMF. Toward the high-mass end, a decreasing 
number of FCMFs contribute to the IGECMF since high SFRs 
become rarer. Thus, the IGECMF becomes increasingly steeper 
toward the high-mass end. For IGECMFs where a bend is not 
explicitly visible - as in all other cases - this explanation holds 
true as well. 


The characteristic shape of the IGECMF - a turn-down at 
the high-mass end and a power-law behavior in the mass range 
below - is observed in the IGIMF (e.g., |Kroupa & Weidner 


2003 [ Weidner & Kroupa||2005[) for the same reason (Pflamm 


Altenburg et al.|2007|l, which means that all SFRs contribute at 


the low-mass range, while only the highest ones - which are 
rare - contribute to the high-mass range. This has a fundamental 
consequence: For an invariant ECMF, a mass distribution func¬ 
tion accumulated over several formation epochs can only be¬ 
come steeper - if high SFRs are rare - or remain unchanged 
- if high SFRs occur frequently - toward the high-mass end, but 
it can never become shallower than the parental ECMF. Thus, 
the slope of the mass distribution function at the high-mass end 
gives clues on the slope of the parental ECMF: The underlying 
ECMF can be shallower or have the same slope as the observed 
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mass function, but it cannot be steeper. Before the slope can be 
determined, however, it first has to be considered how stellar and 
dynamical evolution altered the mass function. 


7. Deriving the cumuiative star ciuster mass 
distributions from the obtained iGECMFs 

Usually, a discretized SC mass distribution is observed for SCs 
in or around a galaxy or a galaxy cluster and not a mass distribu¬ 
tion function like the IGECMF. For the former, light sources are 
selected that presumably are SCs, and their luminosity is con¬ 
verted to a mass with an assumed M/L-ratio. From the individ¬ 
ual mass estimates, a histogram is created that reveals how many 
SCs were found in a certain mass range. 

Therefore, we calculated the mass distributions of SCs from 
the obtained IGFCMFs. As a byproduct, this enables us to test 
how elaborate the IGECMF concept is. We performed this com¬ 
putation in two different ways: For each SFH, we derived the 
cumulative distributions of SC masses, which is the number of 
SCs above a certain mass, Afeci(> from 

a) the obtained IGECMFs (Sect.|^, 

b) the distributions of SFRs as determined in Sect. |^(step[^in 
the enumeration). 

This calculation was restricted to the high-mass end of the SC 
distribution since only the most massive SCs are detected by 
observations. Using these two approaches, we determined the 
masses of the 20 000 most massive SCs and compared their dis¬ 
tributions. Ideally, both approaches should result in the same 
mass distributions, which would lead to the conclusion that 
the IGECMF correctly describes the distribution of SC masses 
formed during many SC formation epochs in which SC masses 
were distributed according to the ECMF (Eq. [Tg. 

For a) the procedure is as follows: Starting at the highest 
Mtnax. each of the obtained IGECMFs is integrated downward 
up to an individual until the condition in Eq. is fulfilled. 
The mass of the heaviest SC is determined by integrating Eq.[3T| 
within the same limits. According to Eq.|^ m 2 is the upper inte¬ 
gration limit of the second massive SC: Now, a limit m 3 is found 
so that Eq. holds true and Eq. is evaluated to obtain the 
mass of this SC. This algorithm is repeated to generate 20000 
SCs in total. 

For b) this is done in the following way: 

1. Determination of Mmax^ For each SFH the highest SFR (ob¬ 
tained in Sect. 1^ step in the enumeration) is selected and 
converted to Mmax according to Eq. Since the inverse 
function of Eq. [^cannot be derived analytically, M^ax is 
found numerically. Knowledge about M^ax fully determines 
the ECMF (cf. Eqs.[^and[g. 

2. Calculating the number of SCs and their masses: With mi = 
Mmax^ is evaluated from Eq. These integration limits 
are used to determine the mass of this most massive SC with 
Eq. |7] The found lower limit m 2 is the upper limit for the 
second massive SC (cf Eq. from which the next lower 
limit m 3 can be determined with Eq. Using these limits, 
the mass of the second massive SC is calculated (Eq.|^. This 
procedure is continued until the masses of the 20000 most 
massive SCs are evaluated. 

3. Selection of the 20000 most massive SCs: Steps □ and 1 ^ 
are repeated for the second, third, fourth, and so on highest 
SFR. Out of all calculations, the 20000 most massive SCs 
are selected. 


Both calculations were performed for each of the six SFHs 
for each y 6 . The resulting cumulative distributions of SC masses, 
Aeci(> Med), are presented in black for fi - 2.0 and in red for 
/? = 2.3 in Fig. [T^ The continuous lines represent the distribu¬ 
tions obtained from the integration of the IGECMF (a) while the 
dashed lines belong to those from integrating the ECMF sepa¬ 
rately for the SC formation epochs with the highest SFRs (b). 
The distributions are plotted such that the lines increase by unity 
at the mass of each SC. The dotted lines show the theoretical 
cumulative SC mass distributions from the ECMF with the same 
P for one single SC formation epoch for comparison. Since the 
ECMF is assumed to be a pure power law, these mass distribu¬ 
tions do not exhibit any steepening toward the high-mass end. 
To omit overlaps, they are shifted to higher values. 

Two features become immediately apparent: 


• Apart from some differences (see below), the cumulative SC 
mass distributions obtained from a) and b) are very similar or 
even indistinguishable. This confirms that it almost does not 
matter whether one samples from the IGECMF (a) or from 
individual ECMFs superposed according to the SFH (b). 

• The resulting SC mass distributions exhibit a more or less 
strongly pronounced turn-down at the high-mass end, as can 
be seen by comparing to the theoretical SC mass distribu¬ 
tions (dotted lines in Fig. [T0) i. This turn-down is explic¬ 
itly visible in the SC mass distributions resulting from the 
exponential and the delayed-exponential SFHs as well as 
the power-law SFH with 77 = 0.5, while it is less apparent 
for those resulting from the power-law SFHs with 7; > 1. 
This finding is independent of /?. Interestingly, a comparable 
steepening toward the high-mass end in the SC mass distri¬ 
bution has previously been detected observationally (IHilker 
| 200 ^ . 

The underlying F(SFR) determines how strongly the turn¬ 
down is pronounced: If high SFRs occur frequently - as for the 
exponential and particularly for the delayed-exponential SFH 
(Fig.|3 - high-mass SCs are formed frequently so that the num¬ 
ber of high-mass SCs increases fast with decreasing mass. In 
these cases, a turn-down in the cumulative SC mass distribution 
is therefore strongly pronounced; for the delayed-exponential 
SFH even more strongly than for the exponential SFH. In con¬ 
trast, if high SFRs are rare - as for the power-law SFHs with 
T] - 2, and 3 - the number of high-mass SCs increases slowly 
with decreasing mass, leading to a moderately well visible turn¬ 
down in the cumulative SC mass distribution. With decreasing 
mass, the slope of the distributions in all cases approach the 
slope of the theoretical cumulative SC mass distributions (dot¬ 
ted lines in Fig.[T 0 |). 

The main difference between the cumulative SC mass dis¬ 
tributions is that the most massive SCs produced by a) are gen¬ 
erally more massive than those obtained from b). The reason 
is the coaction between the new optimal sampling technique 
(Sect. 2.1 1 and the difference in the shape of the IGECMF and 


the ECMF at the high-mass end: For a) as well as for b), to deter¬ 
mine the most massive SC one has to start at M^^x, correspond¬ 
ing to the highest SFR, and integrate downward until the inte¬ 
gral over the IGECMF and the ECMF, respectively, yields unity 
(Eqs.[^and[ 6 ]l. Since the IGECMF increases more strongly in 
integration direction than the respective ECMF, this condition 
will first be fulfilled for the IGECMF (a) leading to a larger m 2 
than for the ECMF (b). Consequently, the IGECMF (a, contin¬ 
uous lines) produces a heavier most massive SC than the cor¬ 
responding ECMF (b, dashed lines), as can be seen in Fig. 

The differences become smaller for increasing p since a larger 
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45678 45678 45678 


logio (Med [M®]) 

Fig. 10. Cumulative mass distributions of the 20 000 most massive SCs for the SFH as indicated in the top right corner of each panel, 
computed a) from the IGECMFs (continuous lines) and b) from the ECMF separately for SC formation epochs with the highest 
SFRs (dashed lines) for /? = 2.0 (black) and [i — 2.3 (red). Note the small deviations between method a) and b). For comparison, the 
dotted lines show the theoretical cumulative SC mass distributions for one single SC formation epoch with the same ECMF in the 
same color. To avoid overlaps, these lines are shifted to higher values. According to the underlying SFHs, the panels are arranged 
in the same manner as in Fig. For instance, as indicated in the lower middle panel there are in total about 10'' 13 SCs above 

4^eci = 10^ Mq. (A color version of this hgure is available in the online journal.) 


P is connected with a steeper ECMF, which alleviates the above 
effect. However, for the mass distributions from the power-law 
SFH with rj - 0.5, the effect appears again weakly at the low- 
mass end. 

In addition, the cumulative SC mass distributions resulting 
from the power-law SFHs with rj - 2, and 3 from method 
a) are shifted to higher values over the whole mass range 
compared to b), whereas those from the exponential and the 
delayed-exponential SFH become indistinguishable with de¬ 
creasing mass. This depends on the respective F(SFR): In 
the case of the exponential and the delayed-exponential SFH, 
F(SFR) is uniformly distributed across the SFR range (Fig. 0. 
Thus, relative high SFRs occur sufficiently often in these SFHs, 
for which reason methods a) and b) produce a similar number of 
high-mass SCs. As a result, method b) catches up with method 
a), leading to mass distributions that become equal below a cer¬ 
tain mass. On the other hand, the power-law SFHs - especially 
with rj - 2, and 3 - demonstrate that the more sparsely high 
SFRs are distributed (Fig. |^, the less method b) is able to catch 
up with method a) and the stronger the separation between the 
cumulative mass distributions of SCs obtained from a) and b). In 


contrast, methods a) and b) lead to cumulative SC mass distri¬ 
butions for the power-law SFHs with rj - 0.5, and 1 which are 
nearly indistinguishable in that mass range. 

According to these hndings, we can summarize that the 
shape of the cumulative mass distributions of the 20000 most 
massive SCs is predominantly determined by the high-SFR end 
of the respective F(SFR). The small differences in these mass 
distributions obtained by sampling from the IGECMF (a) and 
from the ECMFs determined by the highest SFRs (b) are in¬ 
duced by the effect discussed above, namely that at the high- 
mass end the IGECMF increases more strongly in integration 
direction than the corresponding ECMF. 

In other words: The observed differences of method a) and b) 
demonstrate that it does matter in which order the two essential 
steps here - the superposition and the discretization - are carried 
out: For sampling from the ECMFs with the highest SFRs (b), 
first all contributing ECMFs are discretized into populations of 
individual SCs, and afterward, these populations are superposed. 
For (a), the IGECMF is first obtained from superposing all con¬ 
tributing ECMFs and then the IGECMF is discretized to deter¬ 
mine the mass distribution of all SCs. As visualized, the latter 
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leads to slightly higher masses. Nevertheless, the calculations il¬ 
lustrate that the cumulative SC mass distributions obtained with 
the two methods are very similar or even indistinguishable, lead¬ 
ing to the conclusion that the theory of the IGECMF is a suitable 
concept. However, method b) is regarded to be more accurate 
than a) since it compiles the SC masses for each individual SC 
formation epoch separately and superposes them afterward, sim¬ 
ilarly to what nature does. 

Interestingly, even if all IGECMFs look similar (cf Fig. [^, 
the cumulative SC mass distributions are shaped differently for 
different SFHs, but are very similar if obtained from the same 
SFH (cf. Fig. 10 1 . Thus, each SFH leaves through its F(SFR) 
an imprint on the mass distribution of SCs independent of /?. 
This circumstance is particularly important since it potentially 
provides the opportunity to deduce past star formation activities 
from the high-mass end of the mass distribution of an SC sample. 

The derived cumulative SC mass distributions (Fig. 10 1 can¬ 
not be directly compared to observed mass distributions of SCs. 
The reason is that the former describe hypothetical mass distri¬ 
bution of newly born SCs, which emerges from the superposi¬ 
tion of SC populations that formed during many SC formation 
epochs. In contrast, the latter represent today’s SC mass distribu¬ 
tion, whose SC have undergone various alterations during their 
lifetime. The mass of an SC changes with time because of stellar 
evolution and the loss of stars induced by either dynamical evo¬ 
lution or the tidal held, which may even lead to a complete disso¬ 
lution of the SC especially for low-mass ones (e.g.,|Baumgardt 


& Makino |2003[ Earners et al. 2005a 2010[ [Brockamp et al. 


2014|l. We here included none of these effects, but they will be 


accounted for when observational data are analyzed. 


8. Observability 

Even if the presented model is developed to describe the mass 
distributions of SCs formed over a Hubble time in a galaxy clus¬ 
ter, the superposition principle itself is independent of the en¬ 
vironment. Thus, this model is in principle applicable to any 
sample of SCs, for example, around or in individual galaxies, 
and for SC formation timescales of arbitrary length. Since the 
superposition principle implies the development of a turn-down 
at the high-mass end in the mass distribution of SCs if the SFR 
changes with time, the question arises whether such a turn-down 
can be observed and if so, how reliable the interpretation of a 
turn-down would be. As mentioned before, we here derived the 
distribution of the birth stellar masses of SCs, which are different 
from observed SC mass functions since changes in mass during 
the lifetime of an SC are not taken into account. 

Thus, the masses of all individual SCs in the selected sample 
need to be corrected for mass loss since their birth. This requires 
an accurate estimate of the age for each SC, which then allows 
computing the mass loss that is due to stellar and internal dynam¬ 
ical evolution. According to their age, the upper mass limit needs 
to be calculated below which SCs are expected to have dissolved 
since their birth. It is difficult to accurately estimate the number 
and mass distribution of completely dissolved or disrupted SCs 
- for instance, by infant mortality - therefore it is necessary to 
set a lower mass limit at the high-mass end down to which SCs 
in the sample are considered. This mass limit should be well 
above the dissolution limit to avoid the difficult mass correction 
for already dissolved SCs and thus the introduction of substan¬ 
tial uncertainties at the lower mass limit of the considered SC 
sample. The observational completeness limit might restrict the 
allowed SC mass range even more strongly, while the availabil¬ 
ity of more parameters such as the metallicity or the abundance 


of different elements might give further constraints on the age 
and the mass-loss history. 

In addition to stellar and internal dynamical evolution, each 
SC will suffer from mass loss that is due to the gravitational pull 
of the tidal field during its orbit. One can estimate how strong 
the tidal held acted on the individual SC and how much mass 
was lost since its birth if the position of each SC and its orbit is 
known. However, the actual mass loss in a tidal held highly de¬ 
pends on the geometry of the orbit (e.g., |Baumgardt & Makinc^ 
|2003 1, which is usually unknown and very difficult to determine. 
In this case, one could at least estimate for a statistical distribu¬ 
tion of orbits how the SC mass function as a whole might have 
changed with time. This will lead to an increasing statistical un¬ 
certainty in the mass distribution. Carrying out these corrections 
for the SCs in the sample hnally leads to the mass distribution of 
their birth stellar masses. 

However, before comparing the corrected SC mass distribu¬ 
tion with theoretical mass distributions by using the method we 
presented here, we have to take into account that the theoreti¬ 
cal mass distributions were generated by using the introduced 
new sampling technique, which distributes SC masses exactly 
according to the underlying mass function to avoid introduc¬ 
ing Poisson noise. In contrast, the observed SC mass distribu¬ 
tion carries at least some statistical uncertainties that depend on 
the accuracy of the mass correction. Thus, a range of match¬ 
ing SFR distribution functions, F(SFR), is expected rather than 
one well-defined F(SFR). However, the reliability of the anal¬ 
ysis can be enhanced by a prospective sample selection; Either 
the selected SCs are of similar age since in this case one ex¬ 
pects that all effects leading to mass loss act on those SCs in 
a similar way. Alternatively, the selected sample comprises a 
large number of SCs, which reduces the relative statistical uncer¬ 
tainties. Apparently, the best would be to combine both aspects. 
Conceivable applications are listed below. 


SC formation history of the MW 

The MW, as many other galaxies today, is rather in a quiescent 
mode of SC formation. Thus, recent SFRs are usually moderate, 
leading to the formation of SCs that only extend to intermediate 
masses - unless the SC formation activity is pushed as a result 
of an interaction with another galaxy, for instance. The MW has 
an SFR of about 2 Mo/yr (e.g., Chomiuk & Povich||2011 


see 


also references therein), while the mos t massive young SCs typ 
ically have masses below 10^ Mq (e.g., Figer et al. 2006 |Mengel 


|& Tacconi-Garman|2007[[Davies et al.|2007| l. However, there are 

different estimates for the lifetimes of such SCs: IFamers et al.l 
( 2005a|b| l expected from their semi-analytical model an SC with 
a mass of 10"^ Mq to dissolve in about 1 Gyr, while collisional 
A-body computations of Baumgardt & Makino (2003 1 showed 
that an SC with a; 3 x 10"^ Mq at solar position in the MW will 
survive about a Hubble time. Apparently, these two estimates 
alone cannot pin down how far in the past the SC formation his¬ 
tory can be constrained using the most massive SCs in the MW. 
Moreover, this example shows that due to a limited total number 
of SCs available for the analysis, no very robust result for the SC 
formation history is to be expected. 


SC formation history of a galaxy cluster 
As indicated above, the optimal SC sample should comprise a 
large number of objects of similar age. Such a SC sample can 
be found in galaxy clusters like the Fornax galaxy cluster: It 
is expected that numerous SCs and in particular very massive 
ones were formed in interactions of individual galaxies during 
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the early assembly of the host galaxy cluster. At least the most 
massive SCs are able to survive over a Hubble time - and are 
observed today as GCs and partly as UCDs, most of them hav- 


ing an age of the order 10 Gyr (e.g., |Eorbes et al.|2001| 

Kundu 

et al. 

2005 

Hempel et al.|2007| Chilingarian et al.|201 1| 

"rancis 

et al. 

2012 

1 . Eortunately, the very massive SCs are least affected 


by changes in their mass during their lifetime compared to low- 
; SCs (e.g.,|Baumgardt & Makino 2003 i. Thus, it is expected 


that a turn-down in the mass function, if it exists after the birth of 


those SCs, should be preserved. This is indeed the case: HiUcer 
(2009 his Fig. 4,) reported a steepening of the mass function 


toward the high-mass end, where about 1 000 SC-like objects 
are found with masses above 10® Mq (M. Hilker, private com¬ 
munication). Most of them are GCs, while the heaviest objects 
are UCDs, which are compatible with being very massive GCs 
( Mieske et al.||201^ . This demonstrates that such rich GC sys¬ 
tems represent an ideal, statistically signihcant data set for the 
application of the introduced superposition principle with which 
the formation history of these GCs can be uncovered. 


9. Conclusions 

We investigated how the masses of SCs are distributed for dif¬ 
ferent SFHs in a galaxy cluster environment by superposition 
of all newly born SC populations of all SC formation epochs. 
This mass function is composed of the SCs’ birth stellar masses 
since no evolutionary changes of the mass function are taken 
into account. While deriving this mass function, we developed 
the following: 


the appearance of a turn-down at the high-mass end is caused 
by an SFH that changes with time. 


If the SFR changes with time, the mass of the most massive 
SC, Med,max, also changes according to the SFR-Med.max re¬ 
lation (Eq. 25 Fig. |^. Thus, the ECMF is populated up to a 
variable upper limit, while the low-mass end is filled with SCs 
as before. Consequently, there will be fewer very massive SCs 
per low-mass SC, for which reason the IGECME falls off more 
steeply than the parental ECME On the other hand, if the SER is 
constant over several SC formation epochs, no turn-down in the 
IGECME is expected. 

Eor each IGECME - determined by the index of the underly¬ 
ing ECME and the SEH - we computed the cumulative mass dis¬ 
tribution of SCs using the new sampling technique (Sect.|^. The 
turn-down of the IGECME at the high-mass end causes fewer 
very massive SCs to be produced per low-mass SC than for an 
extrapolated ECME with the same slope. Thus, the cumulative 
mass distributions of SCs also steepen toward the high-mass end 
(Eig.[T0|. The shape of the high-mass end even differs among the 
cumulative SC mass distributions: How strongly the steepening 
toward higher masses is pronounced essentially depends on the 
underlying SEH through its F'(SER). Erom this, we conclude: 


• Past star formation activities leave an imprint on an SC mass 
distribution: This imprint is the shape of the mass function at 
the high-mass end. 

We summarize our key conclusion as follows: 


A new analytical sampling technique without stochastic fluc¬ 
tuations was introduced as an improved optimal sampling 
method. This technique enables accurately extracting the 
number of SCs as well as their individual masses from the 
underlying ECME at the same time (Sect. US- 
The length of one star formation epoch, dt, that means, the 
time it takes to form an SC population out of the interstel¬ 
lar medium so that the ECME is fully populated (Sect. 2.1 1 , 
was determined as a function of the index [i using the SER- 


Meci,max relation (Eq. 
obtained 2 Myr < 6t 
tional constraints (Sect.j^. 


Eig. 1^. Eor 2.0 < (3 < 2.3, we 
0 Myr, in agreement with observa- 


Six SEHs (hve decreasing and one increasing in the begin¬ 
ning and decreasing afterward. Sect. were considered, which 
led to the formation of SCs of 10’° Mq in total. Each SEH was 
divided into formation epochs of length 6t (Eig. |^, and we de¬ 
termined for each SC formation epoch its corresponding SER. 
Erom the distribution of SERs, we computed for each SEH the 
distribution function of SERs, F'(SER), which carries informa¬ 
tion about the past star formation activities (Sect.j^: 

• Eor all monotonically declining SEHs used here, we found 
that the corresponding E(SER) was a pure power law 
(Fig.0. This Ending is independent of any input parameter 
and was also derived analytically. 


Based on F(SER), we evaluated the IGECME. This represents 
how the birth stellar masses of all SCs is distributed for each 
SEH (Sect. |§. Remarkably, we found that 

• all IGECMEs steepen toward the high-mass end (Eig. |^. 
This Ending is independent of any parameter such as the 
parental ECME or the underlying SEH. 


• It is expected that a mass function composed of a superposi¬ 
tion of SC populations, each described by a truncated power 
law and fully populated during a formation epoch of length 
6t, will steepen toward the high-mass end if the SER changed 
during an SC formation period longer than 6t. 


The appearance of a turn-down in an SC mass function al¬ 
lows drawing a substantial conclusion: it might reveal why the 
cluster initial mass function (CIME) has a Schechter-type be¬ 
havior ( |Schecht^|1976| l at the high-mass end, which has been 
detected by various authors (|Gieles et al.|2006b Bastian|2008 


uy v aiiwuj auuiwij |XJciJLiaii|z.\,/\,/u 

^arsen|2009l [Vansevicius et al.|2009HBastian et al.|2012a|l and 


was used to describe the distribution of SC masses (e.g., Gieles 


20091. According to our examination, the turn-down can be ex¬ 


plained by a superposition of several SC populations that formed 
during different formation epochs at different SERs. If this is 
indeed the case, then the CIME can be interpreted as a local 
IGECME since the superposition principle will work indepen¬ 
dent of the environment. Consequently, it is not surprising that 
a Schechter-type function oEers a better description than a pure 
power law. 

Additionally, given our Endings, it appears natural that the 
exponential truncation mass. Me or M*, varies among diEer- 
ent SC ensembles and environments (cf. Gieles et al.||2006a|bl 
2009) IBastian et al.| |2012at see also 

max relation. 


Bastian|J200^ Larsen 


Portegies Zwart et al. 


2010 |l: Using the SER-Meci,n 
the mass range containing the turn-down can be converted into 
an SER range to determine how strongly the SER changed in 
the past during the formation of the considered SC ensemble - 
which will be unique for each environment. 

A turn-down in the SC mass distribution is only expected if 
the SER changes with time. If SCs are formed over several SC 
formation epochs at a constant SER, the power-law behavior of 
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the SC mass function will remain unchanged. If the mass func¬ 
tion of an SC ensemble does not show any tendency of a turn¬ 
down, this then indicates that the SCs have formed at a more or 
less constant SFR. 

In addition to the Schechter-type behavior of the CIMF, a 
turn-down or bend in the mass distribution was observed by 
Hilker ( 2009|l for GCs and UCDs around major galaxies in 


galaxy clusters. Our investigations provide a natural explanation 
for the observed steepening; These objects could simply have 
formed during different formation epochs at different SFRs. This 
is a quite reasonable assumption since major galaxies in galaxy 
clusters could indeed have had several interactions with infalling 
galaxies, during which SCs at different SFRs were formed. Thus, 
the superposition principle offers a reasonable physical explana¬ 
tion for the appearance of the turn-down. 

It is currently only poorly known under which conditions the 
GCs and UCDs formed. One method for deriving the formation 


history of an SC ensemble has been described in Maschberger & 


Kroupa ( 2007[) and relies on the information about the mass and 


in particular the age of each SC. Unfortunately, it is not promis¬ 
ing to apply this method to a sample of GCs and UCDs because 
these objects are assumed to be very old, and their age estimates 
suffer from large uncertainties. 

Our investigations provide an alternative approach to derive 
the formation history of an SC ensemble without the requirement 
for detailed age information: Our idea is to reverse the direction 
of thought according to the main hnding of our analysis; to use 
the high-mass end of a known GC/UCD mass distribution to de¬ 
duce which SFRs are necessary to reproduce the observed mass 
distribution. This ansatz is considered to be promising since it 
requires examining the shape of the high-mass end where the 
most massive SCs are located, which are least affected by dy¬ 
namical evolution and have the highest probability of surviving 
violent interactions with the host galaxy. Before doing that, the 
observed GC/UCD mass distribution has to be corrected for all 
possible evolutionary effects that lead to changes in the mass dis¬ 
tribution. This ansatz will be used in a future paper to derive the 
necessary distribution of SFRs from which F’(SFR) and possibly 
also the SFH can be deduced. Hopefully, this will shed light on 
the way major galaxies in galaxy clusters assembled and under 
which conditions the surrounding GC/UCD systems formed. 
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